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BACKGROUND 

10 This invention relates to interferometes, e.g., displacement measuring and dispersion 

interferometers that measure displacements of a measurement object such as a mask stage or 
a wafer stage in a lithography scanner or stepper system, and also interferometers that 
monitor wavelength and determine intrinsic properties of gases. 

Displacement measuring interferometers monitor changes in the position of a 

15 measurement object relative to a reference object based on an optical interference signal. 

The interferometer generates the optical interference signal by overlapping and interfering a 
measurement beam reflected from the measurement object with a reference beam reflected 
from the reference object. 

In many applications, the measurement and reference beams have orthogonal 

20 polarizations and different frequencies. The different frequencies can be produced, for 

example, by laser Zeeman splitting, by acousto-optical modulation, or internal to the laser 
using birefringent elements or the like. The orthogonal polarizations allow a polarizing beam 
splitter to direct the measurement and reference beams to the measurement and reference 
objects, respectively, and combine the reflected measurement and reference beams to form 

25 overlapping exit measurement and reference beams. The overlapping exit beams form an 

output beam that subsequently passes through a polarizer. The polarizer mixes polarizations 
of the exit measurement and reference beams to form a mixed beam. Components of the exit 
measurement and reference beams in the mixed beam interfere with one another so that the 
intensity of the mixed beam varies with the relative phase of the exit measurement and 

30 reference beams. A detector measures the time-dependent intensity of the mixed beam and 

l 
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generates an electrical interference signal proportional to that intensity. Because the 

measurement and reference beams have different frequencies, the electrical interference 

signal includes a "heterodyne" signal having a beat frequency equal to the difference 

between the frequencies of the exit measurement and reference beams. If the lengths of the 

5 measurement and reference paths are changing relative to one another, e.g., by translating a 

stage that includes the measurement object, the measured beat frequency includes a Doppler 

shift equal to 2vnp/X , where v is the relative speed of the measurement and reference 

objects, X is the wavelength of the measurement and reference beams, n is the refractive 

index of the medium through which the light beams travel, e.g., air or vacuum, and p is the 

1 0 number of passes to the reference and measurement objects. Changes in the relative position 
of the measurement object correspond to changes in the phase of the measured interference 
signal, with a 2n phase change substantially equal to a distance change L RT of X l{np) , 
where L RT is a round-trip distance change, e.g., the change in distance to and from a stage 
that includes the measurement object. 

1 5 Unfortunately, this equality is not always exact. Many interferometers include 

nonlinearities such as what are known as "cyclic errors." The cyclic errors can be expressed 
as contributions to the phase and/or the intensity of the measured interference signal and have 
a sinusoidal dependence on the change in optical path length pnL RT . For example, a first 
order harmonic cyclic error in phase has a sinusoidal dependence on {2npnL RT )l X and a 

20 second order harmonic cyclic error in phase has a sinusoidal dependence on 2 {2npnL RT )l /t . 
Additional cyclic errors may include higher order harmonic cyclic errors, negative order 
harmonic cyclic errors, and sub-harmonic cyclic errors. 

Cyclic errors can be produced by "beam mixing," in which a portion of an input beam 
that nominally forms the reference beam propagates along the measurement path and/or a 

25 portion of an input beam that nominally forms the measurement beam propagates along the 
reference path. Such beam mixing can be caused by ellipticity in the polarizations of the 
input beams and imperfections in the interferometer components, e.g., imperfections in a 
polarizing beam splitter used to direct orthogonally polarized input beams along respective 
reference and measurement paths. Because of beam mixing and the resulting cyclic errors, 

30 there is not a strictly linear relation between changes in the phase of the measured 

2 
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interference signal and the relative optical path length pnL between the reference and 

measurement paths. If not compensated, cyclic errors caused by beam mixing can limit.the 

accuracy of distance changes measured by an interferometer. Cyclic errors can also be 

produced by imperfections in transmissive surfaces that produce undesired multiple 

5 reflections within the interferometer and imperfections in components such as retroreflectors 

and/or phase retardation plates that produce undesired ellipticities in beams in the 

interferometer. For a general reference on the theoretical cause of cyclic error, see, for 

example, C.W. Wu and R.D. Deslattes, "Analytical modelling of the periodic nonlinearity in 

heterodyne interferometry," Applied Optics, 37, 6696-6700, 1998. 

10 In dispersion measuring applications, optical path length measurements are made at 

multiple wavelengths, e.g., 532 ran and 1064 nm, and are used to measure dispersion of a gas 
in the measurement path of the distance measuring interferometer. The dispersion 
measurement can be used to convert the optical path length measured by a distance 
measuring interferometer into a physical length. Such a conversion can be important since 

1 5 changes in the measured optical path length can be caused by gas turbulence and/or by a 
change in the average density of the gas in the measurement arm even though the physical 
distance to the measurement object is unchanged. In addition to the extrinsic dispersion 
measurement, the conversion of the optical path length to a physical length requires 
knowledge of an intrinsic value of the gas. The factor T is a suitable intrinsic value and is 

20 the reciprocal dispersive power of the gas for the wavelengths used in the dispersion 
interferometry. The factor T can be measured separately or based on literature values. 
Cyclic errors in the interferometer also contribute to dispersion measurements and 
measurements of the factor T . In addition, cyclic errors can degrade interferometric 
measurements used to measure and/or monitor the wavelength of a beam. 

25 The interferometers described above are often crucial components of scanner systems 

and stepper systems used in lithography to produce integrated circuits on semiconductor 
wafers. Such lithography systems typically include a translatable stage to support and fix the 
wafer, focusing optics used to direct a radiation beam onto the wafer, a scanner or stepper 
system for translating the stage relative to the exposure beam, and one or more 

30 interferometers. Each interferometer directs a measurement beam to, and receives a reflected 
measurement beam from, a plane mirror attached to the stage. Each interferometer interferes 
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its reflected measurement beams with a corresponding reference beam, and collectively the 
interferometers accurately measure changes in the position of the stage relative to the 
radiation beam. The interferometers enable the lithography system to precisely control 
which regions of the wafer are exposed to the radiation beam. 
5 In practice, the interferometry systems are used to measure the position of the wafer 

stage along multiple measurement axes. For example, defining a Cartesian coordinate 
system in which the wafer stage lies in the x-y plane, measurements are typically made of the 
x and y positions of the stage as well as the angular orientation of the stage with respect to 
the z axis, as the wafer stage is translated along the x-y plane. Furthermore, it may be 
10 desirable to also monitor tilts of the wafer stage out of the x-y plane. For example, accurate 
characterization of such tilts may be necessary to calculate Abbe offset errors in the x and y 
positions. Thus, depending on the desired application, there may be up to five degrees of 
freedom to be measured. Moreover, in some applications, it is desirable to also monitor the 
position of the stage with respect to the z-axis, resulting in a sixth degree of freedom. 

15 SUMMARY 

Among other aspects, the invention features electronic processing methods that 
characterize and compensate cyclic errors in interferometric data. Because cyclic errors are 
compensated electronically, the interferometry system that produces the data has greater 
tolerance to optical, mechanical, and electronic imperfections that can cause cyclic errors, 

20 without sacrificing accuracy. The compensation techniques are especially useful for 
interferometric data used to position microlithographic stage systems. 

In part, the invention is based on the realization that prior values of a main 
interferometric signal can be used to calculate an estimate for a quadrature signal for the 
main interferometric signal, and that algebraic combinations of such signals can yield 

25 sinusoidal functions whose time-varying arguments correspond to particular cyclic error 
terms. Hereinafter such functions are sometimes referred to as error basis functions. In 
embodiments in which the interferometer beams have a heterodyne frequency splitting, one 
may also calculate the quadrature signal of the heterodyne reference signal, and the error 
basis functions maybe derived from algebraic combinations the main signal, the reference 

30 signal, and the quadrature signals of the main and reference signals. 
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The error basis functions are used to isolate particular cyclic error terms in the main 
signal and characterize coefficients representative each cyclic error term (e.g., its amplitude 
and phase). For example, algebraic combinations of the error basis functions and the main 
signal and its quadrature signal can move a selected cyclic error term to zero-frequency, 
5 where low-pass filtering techniques (e.g., averaging) can be used to determine its amplitude 
and phase. Such coefficients are stored. Thereafter, a superposition of the error basis 
functions weighted by the stored coefficients can be used to generate an error signal that can 
be subtracted from the main signal to reduce the cyclic errors therein and improve its 
accuracy. 

10 The technique is particularly useful when the Doppler shift is small relative to the 

heterodyne frequency because the frequency of each cyclic error term is nearly equal to that 
of primary component of the main signal, in which case the estimate for the quadrature signal 
of the main signal is more accurate. This is an especially important property because it is 
precisely when the frequencies of the cyclic error terms are near that of the primary 

15 component of the main signal that the cyclic error terms are most problematic because they 
cannot be remove by frequency filtering techniques. Furthermore, at small Doppler shifts, 
one or more of the cyclic error frequencies may be within the bandwidth of a servo system 
used to position a stage based on the interferometric signal, in which the case the servo loop 
may actually amplify the cyclic error term when positioning the stage. Small Doppler shifts 

20 are actually quite common in microlithographic stage systems, such as when searching for an 
alignment mark, scanning in an orthogonal dimension to the one monitored by the 
interferometric signal, and changing stage direction. Moreover, at small Doppler shifts, 
selecting an integral relationship between the sampling rate of the detector and the 
heterodyne frequency (e.g., 6:1) yields an especially simple formula for the quadrature 

25 signal. 

In addition, at small Doppler shifts, the main signal is nearly periodic with the 
heterodyne frequency, in which case prior data can be used to generate the error signal. As a 
result, correction of the main signal can be accomplished with only a single real-time 
subtraction of the error signal from the main signal, significantly reducing the computation 
30 time associated with the correction and thereby reducing data age errors in any servo system 
for position a microlithography stage. 

5 
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We now summarize various aspects and features of the invention. 
In general, in one aspect, the invention features a first method including: (i) providing 
an interference signal S(t) from two beams derived from a common source and directed 
along different paths, wherein the signal S(t) is indicative of changes in an optical path 
5 difference nL{t) between the different paths, where n is an average refractive index along 
the different paths, L(t) is a total physical path difference between the different paths, and t 
is time; (ii) providing coefficients representative of one or more errors that cause the signal 
S (t) to deviate from an ideal expression of the form A l cos (ay + <p (t) + ) , where A ] and 
are constants, (o R is an angular frequency difference between the two beams, and 
10 <p(t) = nkL{t) , with k = 2n\X and A equal to a wavelength for the beams; (iii) calculating a 
quadrature signal S(t) based on the signal S (t); and (iv) reducing the deviation of S (t) 
from the ideal expression using an error signal (t) generated from the coefficients and 
error basis functions derived from the signals S(t) and S(t) . 

Embodiments of the second method may including any of the following features. 
15 The method may further include directing the two beams along the different paths and 

measuring the interference signal S (f ) . For example, at least one of the beams may be 
directed to reflect from a movable measurement object before producing the interference 
signal S(t) . Furthermore, the beam directed to contact the movable measurement object 
may reflect from the measurement object multiple times before producing the interference 
20 signal S(t) . Also, the beams may be directed to reflect from different locations of the 
movable measurement object before producing the interference signal S(t). 
The errors may correspond to spurious beam paths. 

The deviation may be expressed as ^A m p cos^a> K t +— <p(t) + g m p j, where p=\, 2, 

3..., and m is any integer not equal to p, and where the provided coefficients include 
25 information corresponding to at least some of A m p and ^ m p . 

The angular frequency difference co R may be non-zero. 
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The method may further include: providing a reference signal 

S R (t) = A R cos (o} R t + ^ R ), where A R and £ R are constants; and calculating a quadrature 

reference signal S R (t) based on the signal S R (/) , wherein the error basis functions are 
derived from the signals S(t), S(t), S R (t) , and S R (t) . For example, the method may 
5 further include measuring the reference signal S R (/) based on an output from the common 
source. 

Calculating the quadrature signal S(t) may include calculating the quadrature signal 

S(t) based on the expression S (t) = ( coteo M r)S (t -2r) - C ° s2o)mT S(t-r) , where r>0 
w smo) M T 

and o) M is an instantaneous rate of change of a phase of the interference signal S (/■) . For 

10 example, calculating the quadrature signal S(t) may further include approximating a> M 

according to co M » co R +d<p(t)/dt , where q>(t) in the expression for a> M is determined from 

the interference signal S(t) assuming the deviation of S(t) from the ideal expression is 

negligible. Alternatively, calculating the quadrature signal S{t) may further include 

approximating a> M according to co M » co R . In the latter case, calculating the quadrature 

15 signal S(t) may include calculating the quadrature signal S(t) according to the simplified 

expression S(t) = -^=\_S (t -t) + S (t -2t)~] for r = (x + 6xN)/3a> R , where is a non- 
V3 

negative integer. 

Calculating the quadrature reference signal S R (t) may include calculating the 
quadrature reference signal S R (t) based on the expression 
20 S R (t) = (cot co R r)S R (t-2r)- COs20)RT S 1{ (t-r), where t >0. For example, calculating the 
quadrature reference signal S R (t) may include calculating the quadrature reference signal 
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S R (/) according to the simplified expression S R (t) = -j=\_S R (t - r) + S R (/ - 2r)l for 

r = (n + 6kN)/2>co r , where iV is a non-negative integer. 

The interference signal S(t) may be provided at a data rate that is an integer multiple 
of co R j2n . 

5 The error basis functions may correspond to one or more pairs of sine and cosine 

functions having an argument whose time-varying component has the form 
co K t + (m/ p)<p(t) , where p is a positive integer and m is an integer not equal to p. In 
particular, the error basis functions may correspond to multiple pairs of the sine and cosine 
functions. For example, the error basis functions may include multiple pairs of the sine and 
10 cosine functions from a family of the sine and cosine functions with 

{(p = \,m = -\),(p = l,m = 0),(p = l,m = 2),(p = l,m = 3),and(p = 2,m = l)}. 

The method may further include generating the error basis functions from the signals 
S(t) and S(t). 

The method may further include generating the error basis functions from the signals 
15 S(t) , S(t) , S R (t) , and S R (/) . For example, the error basis functions may be generated 
from algebraic combinations of the signals S(t), S(t), S R (t), and S R (/) . 

The method may further include generating the error signal S ¥ (t) . For example, the 
error signal (t) may be generated from a superposition of the error basis functions 
weighted by the coefficients representative of the errors. 
20 Reducing the deviation may include subtracting the error signal (t) from the 

interference signal S(t). 

The method may further include determining a value for the optical path difference 
nL(t) from the interference signal S(t) after its deviations are reduced. 

The quadrature signal S(t) may be calculated from the interference signal S(t) 
25 based on prior values of S (t) according to the approximation S (t) « S (t - InNf co R ) , where 
N is a positive integer. 

8 
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The error basis functions used to generate the error signal (/) may be derived from 
prior values of the signals S(t) and S (/) according to the approximations 
5 (t) » S (t - 2nN/a> R ) and S (t) « S (t - 2nMjco R ) , where N and M are positive integers. 

The angular frequency difference may satisfy co R > 100 -d<p(t)/dt . Furthermore, it 
may satisfy co R > 500-d<p(t)jdt . 

In general, in another aspect, the invention features a second method, which includes: 
(i) providing an interference signal S(t) from two beams derived from a common source and 

directed along different paths, wherein the signal S(t) is indicative of changes in an optical 
path difference nL (t) between the different paths, where n is an average refractive index 
along the different paths, L(t) is a total physical path difference between the different paths, 
and t is time; (ii) providing coefficients representative of one or more errors that cause the 
signal S (t) to deviate from an ideal expression of the form A i cos (a> R t + <p(t) + £ x }, where 
A x and are constants, co R is an angular frequency difference between the two beams, and 
<p(t) = nkL{t) , with k - and X equal to a wavelength for the beams; and (iii) reducing 
the deviation of S{t) from the ideal expression using an error signal S v (7) generated from 
the coefficients and error basis functions derived from the interference signal S (t) based on 
prior values of S (7) according to the approximation S (t) » S (t - 2nN/ co R ) , where N is a 
positive integer. 

Embodiments of the second method may include any of the following features. 

The angular frequency difference may satisfy co R > 100 -d<p(t)/dt . 

The method may further include providing a reference signal 
S R (^) = A R cos(a> R t + £ R ), where co R is non-zero and A R and £ k are constants, and wherein 
the error basis functions are derived from the signals S (/) and S R (t) . Furthermore, the 
derivation of the error basis functions may be based on prior values of S R (t) according to 
S R (t) = S R (t- 2nMj a> R ) , where M is a positive integer. 
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Embodiments of the second method may further include any of the features described 
above in connection with the first method. 

In general, in another aspect, the invention features a third method. The third method 
is for estimating coefficients representative of one or more errors that cause an interference 
signal S (t) from two beams derived from a common source and directed along different 
paths to deviate from an ideal expression of the form A x cos(<sy + cp{t) + ) , wherein the 
signal S(t) is indicative of changes in an optical path difference nL(t) between the different 
paths, where n is an average refractive index along the different paths, L{t) is a total 
physical path difference between the different paths, t is time, A x and £ are constants, eo R is 
an angular frequency difference between the two beams, and <p(t) = nkL (t) , with k = 2n/X 
and X equal to a wavelength for the beams. The third method includes: (i) calculating a 
quadrature signal S(t) based on the signal S(t) ; and (ii) calculating an estimate for the 
coefficients based on the signals S(t) and S(t). 

Embodiments of the third method may further include any of the following features. 

The method may further include: providing a reference signal 
S R (t) = A R cos (o) R t + C R ), where cd k is non-zero and A R and C, R are constants; and 

calculating a quadrature reference signal S R (t) based on the signal S R (t) , wherein the 

estimate for the coefficients is based on the signals S(t), S(t), S R (t) , and S R (t) . 

Calculating the estimate for the coefficients may include generating error basis 
functions derived from the signals S(t) and S(t) . 

Calculating the estimate for the coefficients may include generating error basis 
functions derived from the signals S(t) , S (/) , S R (/) , and S R (r) . For example, the error 
basis functions may correspond to one or more pairs of sine and cosine functions having an 
argument whose time- varying component has the form G) R t + (m/ p)q>{t) , where p is a 
positive integer and m is an integer not equal to p. Furthermore, the error basis functions 
may be generated from algebraic combinations of the signals S(t), S(t), S R (t) , and S R (t) . 
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Also, the error basis functions may correspond to multiple pairs of the sine and cosine 
functions. For example, the error basis functions may include multiple pairs of the sine and 
cosine functions from a family of the sine and cosine functions with 
{(p = l,m = -l),(p = \,m = 0),(p = l,m = 2),(p = l,m = 3),and(p = 2,m = l)}. 
5 Calculating the estimate for the coefficients may include low-pass filtering algebraic 

combinations of the error basis functions and the signals S(t) and S(t) . 

Calculating the estimate for the coefficients may include low-pass filtering algebraic 
combinations of the error basis functions and the signals S (t) and S(t) . 

For example, the low-pass filtering may include using a Butterworth filter. 
1 0 Embodiments of the third method may further include any of the features described 

above in connection with the first method. 

In another aspect, the invention features an apparatus including a computer readable 
medium which during operation causes a processor to perform any of the first, second, or 
third methods. 

15 In general, in another aspect the invention features a first apparatus including: (i) an 

interferometry system which during operation directs two beams derived from a common 
source along different paths and provides an interference signal S(t) from the two beams, 
wherein the signal S(t) is indicative of changes in an optical path difference nL{t) between 
the different paths, where n is an average refractive index along the different paths, L{t) is a 

20 total physical path difference between the different paths, and t is time, wherein 

imperfections in the interferometry system produce one or more errors that cause the signal 
S (/) to deviate from an ideal expression of the form A x cos [a> R t + (p (/) + ^, ) , where A 1 and 

are constants, co R is an angular frequency difference between the two beams, and 
q>(t) - nkL(t) , with k = 27t/2. and X equal to a wavelength for the beams; and (ii) an 

25 electronic processor which during operation receives the interference signal S (t) from the 
interferometry system, receives coefficients representative of the one or more errors, 
calculates a quadrature signal S(t) based on the signal S(t), and reduces the deviation of 
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S(t) from the ideal expression using an error signal (t) generated from the coefficients 

and error basis functions derived from the signals S(t) and S(t) . 

Embodiments of the first apparatus may include features corresponding to any of the 
features described above in connection with the first method. 

In general, in another aspect, the invention features arm apparatus including: (i) an 
interferometry system which during operation directs two beams derived from a common 
source along different paths and provides an interference signal S(t) from the two beams, 
wherein the signal S(t) is indicative of changes in an optical path difference nL{t) between 
the different paths, where n is an average refractive index along the different paths, L{t) is a 
total physical path difference between the different paths, and t is time, wherein 
imperfections in the interferometry system produce one or more errors that cause the signal 
S(t) to deviate from an ideal expression of the form A l cos (ay + (p(t) + ) , where A x and 

are constants, co R is an angular frequency difference between the two beams, and 
<p(t) = nkL{t) , with k = In/X and /I equal to a wavelength for the beams; and (ii) an 
electronic processor which during operation receives the interference signal S{t) from the 
interferometry system, receives coefficients representative of the one or more errors, and 
reduces the deviation of S[t) from the ideal expression using an error signal S ¥ (t) 
generated from the coefficients and error basis functions derived from the interference signal 
S{t) based on prior values of S(t) according to the approximation S(t) « S{t -2zN / co R ) , 
where N is a positive integer. 

Embodiments of the second apparatus may include features corresponding to any of 
the features described above in connection with the second method. 

In general, in another aspect, the invention features a third apparatus including: (i) an 
interferometry system which during operation directs two beams derived from a common 
source along different paths and provides an interference signal S(t) from the two beams, 
wherein the signal S(t) is indicative of changes in an optical path difference nL{t) between 
the different paths, where n is an average refractive index along the different paths, L{t) is a 

12 
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total physical path difference between the different paths, and / is time, wherein 
imperfections in the interferometry system produce one or more errors that cause the signal 
S(t) to deviate from an ideal expression of the form A x cos (ay + (p{t) + ) , where A t and 

are constants, a> R is an angular frequency difference between the two beams, and 
<p(t) = nkL (t) , with k - 2nfX and A equal to a wavelength for the beams; and (ii) an 
electronic processor which during operation receives the interference signal S(t) from the 
interferometry system, calculates a quadrature signal S(t) based on the signal S(t), and 
calculates an estimate for the coefficients representative of the one or more errors based on 
the signals S(t) and S(t). 

Embodiments of the third apparatus may include features corresponding to any of the 
features described above in connection with the third method. 

In another aspect, the invention features a lithography system for use in fabricating 
integrated circuits on a wafer, the system including a stage for supporting the wafer, an 
illumination system for imaging spatially patterned radiation onto the wafer, a positioning 
system for adjusting the position of the stage relative to the imaged radiation, and any of the 
first, second, and third apparatus for monitoring the position of the wafer relative to the 
imaged radiation. 

In another aspect, the invention features a lithography system for use in fabricating 
integrated circuits on a wafer, the system including a stage for supporting the wafer, and an 
illumination system including a radiation source, a mask, a positioning system, a lens 
assembly, and any of the first, second, and third apparatus, wherein during operation the 
source directs radiation through the mask to produce spatially patterned radiation, the 
positioning system adjusts the position of the mask relative to the radiation from the source, 
the lens assembly images the spatially patterned radiation onto the wafer, and the apparatus 
monitors the position of the mask relative to the radiation from the source. 

In a further aspect, the invention features a beam writing system for use in fabricating 
a lithography mask, the system including a source providing a write beam to pattern a 
substrate, a stage supporting the substrate, a beam directing assembly for delivering the write 
beam to the substrate, a positioning system for positioning the stage and beam directing 
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assembly relative one another, and any of the first, second, and third apparatus for monitoring 
the position of the stage relative to the beam directing assembly. 

In another aspect, the invention features a lithography method for use in fabricating 
integrated circuits on a wafer, the method including supporting the wafer on a moveable 
5 stage, imaging spatially patterned radiation onto the wafer, adjusting the position of the stage, 
and monitoring the position of the stage using any of the first, second, and third methods. 

In a further aspect, the invention features a lithography method for use in the 
fabrication of integrated circuits including directing input radiation through a mask to 
produce spatially patterned radiation, positioning the mask relative to the input radiation, 
1 0 monitoring the position of the mask relative to the input radiation using any of the first, 
second, and third methods, and imaging the spatially patterned radiation onto a wafer. 

In yet another aspect, the invention features a lithography method for fabricating 
integrated circuits on a wafer including positioning a first component of a lithography system 
relative to a second component of a lithography system to expose the wafer to spatially 
1 5 patterned radiation, and monitoring the position of the first component relative to the second 
component using any of the first, second, and third methods. 

In a further aspect, the invention features a method for fabricating integrated circuits, 
the method including a foregoing lithography method. 

In another aspect, the invention features a method for fabricating integrated circuits, 
20 the method including using a foregoing lithography system. 

In yet another aspect, the invention features a method for fabricating a lithography 
mask, the method including directing a write beam to a substrate to pattern the substrate, 
positioning the substrate relative to the write beam, and monitoring the position of the 
substrate relative to the write beam using any of the first, second, and third methods. 
25 Unless otherwise defined, all technical and scientific terms used herein have the same 

meaning as commonly understood by one of ordinary skill in the art to which this invention 
belongs. In case of conflict with publications, patent applications, patents, and other 
references mentioned incorporated herein by reference, the present specification, including 
definitions, will control. 

30 Other features, objects, and advantages of the invention will be apparent from the 

following detailed description. 
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DESCRIPTION OF DRAWINGS 

FIG. la is a schematic diagram of a processing unit for generating cyclic error basis 
functions and characterizing cyclic error coefficients based on a main interference signal 
S(t) and a reference signal S R (/) . 
5 FIG. lb is a schematic diagram of a processing unit for generating an error signal 

S v (t) from the cyclic error basis functions and characterized coefficients and using the error 
signal to reduce cyclic errors in the main interference signal S (t) . 

FIG. 2 is a schematic diagram of an M ,h order digital filter for use in low-pass 
filtering algebraic combinations of the main signal, the reference signal, their quadrature 
10 signals, and the error basis functions to yield the cyclic error coefficients. 

FIG. 3 is a schematic diagram of an interferometry system including a high-stability 
plane mirror interferometer (HSPMI). 

FIG. 4 is a schematic diagram of an embodiment of a lithography tool that includes an 
interferometer. 

15 FIG. 5(a) and FIG 5(b) are flow charts that describe steps for making integrated 

circuits. 

FIG 6 is a schematic of a beam writing system that includes an interferometry 

system. 

Like reference symbols in the various drawings indicate like elements. 

20 DETAILED DESCRIPTION 

Embodiments include an electronic cyclic error compensation procedure (CEC) for 
compensation of cyclic error effects in interferometry applications, such as heterodyne 
interferometry. In preferred embodiments, the compensation is achieved for low slew rates 
of a plane mirror measurement object attached to a stage or attached to a reference system 
25 with associated interferometers attached to a stage. When optical techniques are used to 

eliminate and/or reduce the amplitudes of certain cyclic errors such as sub-harmonic cyclic 
errors to < 0.05 nm (3a), the remaining cyclic errors of the harmonic type with amplitudes of 

0.5 nm or less can be treated as having constant amplitudes with fixed offset phases and the 
required accuracy of the cyclic error compensation for a remaining cyclic error term is 

15 
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approximately 10 % to meet a compensated cyclic error budget of 0.05 ran (3a) or less. 
Further, the number of cyclic error terms that need to be compensated electronically are 
typically a small number, e.g., of the order of 3. In preferred embodiments, the processing 
operations of CEC requiring higher digital processing rates can be limited to a single add 
5 operation, whereas the remaining processing operations, which require additions, 

subtractions, multiplications, and divisions, can be performed at lower rates using prior 
values of the interference signal. 

Typically, cyclic error effects in heterodyne interferometry can be eliminated by 
filtering the heterodyne signal in frequency space, e.g., using Fourier spectral analysis, when 

10 the Doppler shift frequencies can be resolved by the frequency resolution of a phase meter 
used to determine the heterodyne phase. Unfortunately, such filtering techniques cannot be 
used to eliminate cyclic error effects at low slew rates of the stage (including, e.g., zero speed 
of the stage) when the corresponding Doppler shift frequencies cannot be distinguished from 
the frequency of the primary signal. Further complications arise with cyclic error frequencies 

15 are within the bandwidth of the servo system, in which case the cyclic errors can be coupled 
directly into the stage position through the servo control system, and even amplify the error 
in the stage position from a desired position. 

Specific details of a preferred embodiment of the CEC are described further below. 
Advantages of that embodiment include a cyclic error correction signal that may be 

20 generated in a "feed forward mode," where the feed forward mode can involve a simple 

digital transform based on a translation in time and need not require a spectral analysis or the 
use of a digital transform such as a digital Fourier transform, such as a fast Fourier transform 
(FFT). Likewise, conjugated quadratures of the main interference signal and the reference 
signal can be generated by simple digital transforms and need not require the use of a digital 

25 transform such as a digital Hilbert transform. Moreover, the feed forward mode can reduce 
the number of computer logic operations that are required at the very highest compute rates 
and can thereby reduce errors in data age that are introduced by incorporation of CEC. 

Another advantage is that the cyclic error coefficients can be characterized at Doppler 
shift frequencies for which the phase meter cannot distinguish between the cyclic error 

30 frequencies from the frequency of the primary component of the interference signal. 

Furthermore, the cyclic error coefficients can be characterized and used for compensation 
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over a range of Doppler shift frequencies that is small relative to heterodyne frequency, 
which is a range over which the cyclic error coefficients are typically frequency independent, 
thereby simplifying the cyclic error correction. 

Prior to describing a preferred embodiment of the CEC, it is useful to classify cyclic 
errors according to properties with respect to the cyclic error amplitudes. Three 
classifications are set out with reference to a standard high stability plane mirror 
interferometer (HSPMI). They are: Type 1 - Constant amplitude cyclic errors; Type 2 - 
Variable amplitude cyclic errors; and Type 3 - Intermittent cyclic errors. 

The amplitudes of Type 1 cyclic errors are independent of the orientation of a plane 
mirror measurement object of a plane mirror interferometer. 

The amplitudes of Type 2 cyclic errors are dependent on the orientation of the plane 
mirror measurement object with a relative variability similar to that experienced by the 
amplitude of the primary component of the respective electrical interference signal, e.g., <± 
20 %. 

The amplitudes of Type 3 cyclic errors are nominally zero except when the reflecting 
surface of the plane mirror measurement object is parallel to within * 50 urad of a conjugate 
image of a reflecting or partially reflecting surface, e.g., the reference mirror, of the 
interferometer. 

Examples of Type 1 cyclic errors are the harmonic cyclic errors generated by 
polarization mixing in the source of an input beam to the heterodyne interferometer, 
polarization mixing that is produced by a polarization beam-splitter having finite extinction 
ratios that splits the input beam into reference and measurement beams, and polarization 
mixing that is produced by a silver coated cube corner retroreflector such as commonly used 
in a high stability plane mirror interferometer (HSPMI). The amplitudes of Type 1 cyclic 
errors are typically < 0.25 run using techniques such as described in U.S. Patent Application 

Serial No. 10/174,149 entitled "Interferometry System and Method Employing an Angular 
Difference in Propagation Between Orthogonally Polarized Input Beam Components" to 
Peter de Groot and Henry A. Hill and filed July 17, 2002, in U.S. Patent No. 6,201,609 Bl 
entitled "Interferometers Utilizing Polarization Preserving Optical Systems" to Henry A. Hill, 
and in U.S. Patent No. 6,163,379 entitled "Interferometer with Tilted Waveplates for 
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Reducing Ghost Reflections" to Peter de Groot, the contents of which is incorporated herein 
by reference. 

Examples of Type 2 cyclic errors are the harmonic cyclic errors generated by 
spurious beams due to unwanted reflections by certain surfaces. The typical amplitude of a 
5 Type 2 cyclic error is 0.06 nm for a respective surface having a reflectivity of 0.0025. The 
amplitude of a particular Type 2 cyclic error will typically vary by < ± 20 % as the stage 

mirror is scanned through a range of for example ± 500 prad in orientation with a physical 

separation of « 0.7 m between the polarizing beam-splitter of an interferometer and plane 

mirror measurement object. 

10 Examples of Type 3 intermittent cyclic errors are sub-harmonic cyclic errors that 

have relatively large amplitudes, e.g. 2 nm, when the conditions for Type 3 cyclic errors are 
met. The Type 3 cyclic errors can be eliminated or reduced to below 0.025 nm (3a) by 
optical techniques where elements of the interferometer are rotated or tilted to eliminate or 
reduce certain cyclic non-linear errors such as described in U.S. Patent Application 

15 Publication No. 2003/0038947 entitled "Tilted Interferometer" to Henry A. Hill, the contents 
of which are herein incorporated in its entirety by reference. The elimination of Type 3 
cyclic errors by the optical techniques considerably reduces the task left for CEC in 
achieving a compensated cyclic error residual of 0.025 nm (3o-) or less. Of course, in further 
embodiments of the invention, sub-harmonic cyclic errors (such as the Type 3 half-cycle 

20 error) may also be compensated, as described further below. 

The variability of Type 2 cyclic errors expressed in terms of a displacement is 
typically < ± 0.010 nm in amplitude. As a consequence, Type 2 cyclic errors can be treated 

as cyclic errors having constant amplitudes with constant offset phases in implementation of 
CEC that has a compensated cyclic error budget of 0.01 nm (3a) or less. 

25 The number of cyclic error terms that need to be compensated when the Type 3 cyclic 

errors are eliminated or reduced by the optical techniques are low, e.g. of the order of 3, for a 
compensated cyclic error budget of 0.05 nm (3a) or less. A particular cyclic error term after 
elimination of Type 3 cyclic errors may comprise one or more Type 1 and/or Type 2 cyclic 
errors. Nonetheless, in further embodiments, the CEC may also be used to compensate Type 

30 3 cyclic errors. 
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Additional material describing quantifying and correcting cyclic errors are described 
in commonly owned U.S. Patent No. 6,252,668, U.S. Patent No. 6,246,481, U.S. Patent No. 
6,137,574, U.S. Patent Application Publicaiton No. 2002/0089671, and U.S. Patent 
Application Serial No. 10/287,898 entitled "Interferometric Cyclic Error Compensation" 
5 filed November 5, 2002 by Henry A. Hill, the entire contents each of which are incorporated 
herein by reference. 

We now described a preferred embodiment for the CEC that generates in a feed 
forward mode of operation, in which a cyclic error correction signal Sy (t) is subtracted 

from a corresponding electrical interference signal S[t) of an interferometer to produce a 
10 compensated electrical interference signal. The phase of the compensated electrical 
interference signal is then measured by a phase meter to extract relative path length 
information associated with the particular interferometer arrangement. Because cyclic error 
effects have been reduced, the relative path length information is more accurate. As a result, 
the compensated electrical interference phase can be used to measure and control through a 
15 servo control system the position of a stage, even at low slew rates including a zero slew rate 
where cyclic error effects can otherwise be especially problematic. 

Referring to FIGS, la and lb, in a preferred embodiment, the CEC comprises two 
processing units. One processing unit 10 determines cyclic error basis functions and factors 
relating to the amplitudes and offset phases of cyclic errors that need be compensated. A 
20 second processing unit 60 of CEC generates cyclic error correction signal (r) using the 

cyclic error basis functions and the factors relating to the amplitudes and offset phases 
determined by first processing unit 10. The first processing unit 10 of CEC for the first 
embodiment is shown schematically in Fig. la and the second processing unit 60 of CEC of 
the first embodiment is shown schematically in Fig. lb . 
25 Referring now to FIG,, la, an optical signal 11 from an interferometer is detected by 

detector 12 to generate an electrical interference signal. The electrical interference signal is 
converted to digital format by an analog to digital converter (ADC) in converter/filter 52 as 
electrical interference signal S(t) and sent to the CEC processor. For example, the ADC 
conversion rate is a high rate, e.g., 120 MHz. 
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In the present embodiment, we focus on a particular set of four cyclic error terms are 

compensated at low slew rates. Adaptation to compensate for a different set of cyclic errors 

will be evident to one skilled in the art based on the subsequent description. The electrical 

interference signal S(t) comprising the four cyclic error terms can be expressed in the form 



S(t) = 4 cos(<p R + 9 + ) + Sy (t) (1) 
where 

(0 = (0 + 5 v0 (0 + S V2 (0 + s v3 (0 ; (2) 

5 , M ,_i(0 = e_icos((p^ -<p + S-i) . ( 3 ) 

-V) = e O cos (<P*+Co) > ( 4 ) 

^2 =8 2 cos (^+29 + ^2) > (5) 

^3 =E 3 C0S ( ( P^ +3( P + ^3) 5 ( 6 ) 



cp^ is the phase of a reference signal with d(p R /dt = (o R corresponding to 2k times 

the frequency difference of the measurement beam and reference beam components of the 
input beam to the interferometer; A\ and are the amplitude and offset phase, respectively, 
of the primary component of the electrical interference signal; e_i , e 0 , e 2 > anc * e 3 are the 
amplitudes for the cyclic error terms; , £ 0 , £ 2 » ^ ^3 ^ me offset phases of the cyclic 
error terms; 

9 = AkL (7) 
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for a plane mirror interferometer such as a HSPMI (which involves two passes of the 
measurement beam to the measurement object); k is a wavenumber corresponding to 
wavelength X of beam 10; and L is the one way physical length of the measurement path 
relative to the one way physical length of the reference path of the interferometer. Cyclic 
5 error amplitudes s_i , s 0 , e 2 , and e 3 are much less than the A l , i.e. < (1/50) A x . An 
example of the frequency difference a> R /2n is 20 MHz. 

Note that there is generally a set of cyclic error terms whose phases are independent 
of <p R . This set of cyclic error terms has been omitted from Equation (2) because they are 
eliminated by a high pass filter in converter/filter 52. 
10 The factors relating to amplitudes e p and offset phases C, p of the cyclic error terms 

and the time dependent factors of the cyclic error terms are generated using measured values 
of both S(t) and reference signal S R {t) . The factors relating to amplitudes e p and offset 

phases C, p are determined and the results transmitted to a table 40 for subsequent use in 
generation of the cyclic error correction signal Sy (/) . The time dependent factors of the 

1 5 cyclic error terms are obtained by application of simple digital transforms based on 
trigonometric identities and properties of conjugated quadratures of signals. 

Optical reference signal 13 is detected by detector 14 to produce an electrical 
reference signal. The optical reference signal can be derived from a portion of the input 
beam to the interferometer. Alternatively, the electrical reference signal can be derived 

20 directly from the source that introduces the heterodyne frequency splitting in the input beam 
components (e.g., from the drive signal to an acousto-optical modulator used to generate the 
heterodyne frequency splitting). The electrical reference signal is converted to a digital 
format and passed through a high pass filter in con verter/fi Iter 54 to produce reference signal 
S R (t) . Reference signal S R (t) in digital format is written as 



S R =A R cos^Hfl) 



(8) 
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where A R and C, R are the amplitude and offset phase, respectively, of the reference signal. 

The ADC conversion rate in 54 for S R (t) is the same as the ADC conversion rate in 52 for 

S(t) . The quadrature signal S R (t) of S R (t) written as 

5 S R (t) = A R sm(<p R +S R ) (9) 

is generated by electronic processing using measured values of S R (t) according to the 
formula 

10 S R (0 = (cot g^x) S R (t - 2x) _ cos2g >R x Sr (t-x) (10) 

sinco^x 

where 1/x is the ADC conversion rate of reference signal S R (t) in 54 . For the example of a 
frequency difference for o) R /2n = 20 MHz and an ADC conversion rate 1/x of 120 MHz, 
Equation (10) reduces to a particularly simple form 

15 

S R {t) = -L[S R (t-x) + S R (t-2x)] . (11) 

Reference signal S R (t) and quadrature signal S R (t) are conjugated quadratures of 
reference signal S R (t) . The quadrature signal S R (t) is generated by processor 16 using 
20 Equation (1 1) or Equation (10) as appropriate. - 

The quadrature signal S(t) of S(t) is generated by processor 56 using the same 
processing procedure as that described for the generation of quadrature signal S R (t) . 
Accordingly, for the example of a ratio of 1/x and co^/27t equal to 6, 
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S(t)=-j=[S(t-T) + S(t-2x)] 



(12) 



where 



+ E2 sin {<PR + 2 <P + S 2 ) + e 3 sin (<?R + 3( P + ^3 ) • 

Equation (12) is valid when the stage slew rate is low, e.g., when (p changes 
insignificantly over the time period 2r . Signal S(J) and quadrature signal S(t) are 
conjugated quadratures of signal S(t) . 

Notably, the integral relationship between 1/x and (o R /2n allows the generation of 
feed forward values S'(t) and S'(0 of S(t) and S(t) , respectively, and S' R (t) and S' R (t) of 
S R (t) and 5^(0 , respectively, are according to the formulae 



S'(t) = S(t-6mx), 
S'(t) = S(t-6mx) , 

S R (t) = S R (t-6mx) 



(14) 



(15) 



(16) 
(17) 



where m is an integer such that the error in the phases of feed forward signals with respect to 
corresponding phases of signals is less than predetermined values set by an end use 
application. In other words, prior values of the main interference signal and the reference 
signal can be used to generate the quadrature signals and subsequent error basis functions. In 
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other embodiments in which the ratio between 1/x and w R /2n is an integer different from 6, 
Equations (14)-(17) are modified accordingly. 

Using algebraic combinations of the signals S(t) , S(t) , S R (t) , and S R (t) , 
processing unit 10 generates cyclic error basis functions, which are sine and cosine functions 
that have the same time-varying arguments as the cyclic error terms, and then uses the cyclic 
error basis functions to project out respective cyclic error coefficients from S(t) and S(t) by 
low-pas filtering (e.g., averaging). The cyclic error basis functions for Sy 0 (t) , for example, 
are especially simple and correspond to the reference signal and its quadrature signal S R (t), 
and S R (t) . In other words, to process the signals for information about the cyclic error term 
80 cos(cpfl + C,o ) , signals S R (t) and S R (t) are used as time dependent factors in the 
representation of the cyclic error term s 0 cos(cp^ + So ) ■ 

To better understand the representation, it is beneficial to rewrite cyclic error term 
6 0 cos (cp# + So ) in terms of the time dependent functions cos (cp^ + S# ) and sin ((p R +C >R ) 
with the results 



e 0 cos((p/?+; 0 ) = eo 



cos(Co-5*)cos(<|>rt+^) 
_-sin(S 0 -S/?) sin (<P/?+C/?)J 



(18) 



e 0 sin((p^ + So) = £ o 



cos(S 0 -S/?) sin (<Ptf + O?) 
+ sin(Co-C/ ? ) cos ( ( P* 



(19) 



Equations (18) and (19) can be rewritten as 

e 0 cos(<? R + So ) = [Aq cos(q> R +t; R )-B 0 sin (y R + C, R )] 

eo sin (op* + So ) = [A) sin (q> R + S* ) + B 0 cos (<p R + C, R )] 



(20) 



(21) 
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where 



Aq =e 0 cos (; 0 -<;,?) , 
B 0 =E 0 sin(i; 0 -C, R ) . 
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(22) 



(23) 



Conjugated quadratures S(t) and S(t) and conjugated quadratures S R (t) and 
S R (t) are transmitted to processor 20 wherein signals E 0 (/) and S 0 (/) are generated. 
Signals S 0 (0 m & ^0 (0 816 given by the equations 



(24) 



so(0— 5(0^(0+^(0^(0 



(25) 



Using properties of conjugated quadratures and certain trigonometric identities, e.g., 



So (0 = A rA) 

A x cos (<p + ^ - C, R ) + e_! cos (-9 + <;_! - C, R ) 
+s 2 cos (2cp + C 2 - ) + e 3 cos (3q> + ; 3 - ^ ) 



(26) 



±o(t) = A R B 



A sin (<P + Si ~ £>R ) + e -l sin ("<P + Cl - £>R ) 
+e 2 sin(2cp + ^2 " S/? ) + e 3 sin(3cp + ^ 3 ) 



(27) 



Notably, the generation of signals S 0 (0 an d (0 project the coefficients associated with 
SyoCO to zer0 frequency, where low-pass filtering techniques can be used to determine 
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them. Thus, signals £ 0 (/) and £ 0 (/) are transmitted to low pass digital filters in processor 
24, e.g., low pass Butterworth filters, where coefficients A r Aq and A r Bq are determined. 

For a Butterworth filter T„ (x) of order n , the corresponding outputs of the low pass 
digital filters for inputs £q(0 m & ^o(0 xe 



= A r Aq 



+e 2 0 



(-e 3 0 



3©r 



(28) 



40^ 



2(0/ 



(29) 



where O(x) denotes a term of the order of x , co c is the - 3 dB angular cutoff frequency, and 
(Op = dy/dt . 

The terms on the right hand sides of Equations (28) and (29) with factors A R A l are 
the sources of the largest errors and accordingly determine the specifications of n and the 
minimum ratio for co^ /co c that can be used when the outputs of processor 24 are stored in 
table 40. For a fourth order Butterworth filter, i.e., n = 4, and a minimum ratio 
fora>£>/a> c = 7 , the error terms on the right hand side of Equations (28) and (29) will 
generate errors that correspond to < 0.010 nm (3 a ). When the stage is moving at a speed 
such that the corresponding Doppler shift frequency a D /2n is 10 to 100 times greater than 
the bandwidth of the stage servo control system and the requirement with respect to g> d /co c 

26 
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is satisfied, the outputs A r Aq and A r Bq of the low pass filters in processor 24 are stored in 

table 40 and in processors 26 and 28 under the control of signal 72. 

Notably, in this preferred embodiment, co D can vary by factors such as 2 or more 

during the period associated with output values of A r Aq and A R B 0 that are stored in table 

40. 

Quadratures S R and S R are transmitted to processor 22 and the value for A R 2 is 
determined in processor 22 by a procedure similar to that used in processor 20 and processor 
24 using the formulae 

T n [s R (t)-S R {t) + S R (t)S R (t)] = A R 2 . (30) 

The value of the order n need only be for example 2. Values of A R are 
transmitted to table 40 and stored under the control of signal 72. 

The values for A r Aq , A R B 0 , S(t), S(t), and A R are transmitted to processor 26 
and the values of A r Aq , A R B Q , and A R are stored in processor 26 under the control of 
signal 72 for the generation of conjugated quadratures Sj (?) and S\{t) where 



A R a r 

= A x cos (<p R + cp + ^ ) + e_i cos((p R - (p + C-i) (31) 
+ s 2 cos(cp^ +2<p + C, 2 ) + £3 cos((p /? +3(p + <; 3 ), 



a r a r 

= A sin (<P/? + 9 + ) + e-i sin (y R - cp + d ) (32) 
+ 828^(9^ +2(p + ^ 2 ) + £3sin((p / j +3(p + ^ 3 ). 
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The values for A r Aq , A R B 0 , S 0 (0 > t Q (t) are transmitted to processor 28 and 
the values of A r Aq and A R B 0 are stored in processor 28 under the control of signal 72 for 
the generation of conjugated quadratures Ej(0 and t^t) where 



Zi(') b2 o(0-4i4> 

\A x cos(cp + + s_! cos(-(p + C-i - C/? ) 1 

+s 2 cos(2(p + ; 2 - C /? ) + £3 C0S ( 3( P + ^3 - C/f ) J 



(33) 



Z, (/)-Z 0 (f)-^ 

("^ sin((p + ^ - ^ ) + e_j sin(-q> + - ^ ) 

= /i^ 

|_+e 2 sin(2cp + <; 2 - ^ ) + e 3 sin(3cp + <; 3 - ^ ) 



(34) 



Signals S R (t) , S R (t) , ^(t) , and t^(t) are transmitted to processor 30 for the 
generation of conjugated quadratures 2_i(0 and 2_i(/) where 



S_ 1 (0-LiS je +2 1 S il 

E_ 1 cos((p i? +(p-C_ 1 +2; /? ) 
= y4 2 +4 cos ( ( P«-9-<;i+2^) 
+e 2 cos((p^-2(p-; 2 +2^) 
_+e 3 cos(9^-3(p-^3+2^) 



(35) 



e_] sin (<p R + 9 -<;_! + 2^) 
= ^ 2 +^isin((p/? -cp-^i +2^) 
+s 2 sin((p^ -2q>-C 2 +2^) 
_+s 3 sin((p w -3q>-^ 3 + 2^) 



(36) 
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Signals I_i(?) and Z_i (0 correspond to the cyclic errror basis functions for 
i n that the leading terms of £_j (?) and I_j (?) are sinusoids with the same time- 
varying argument as that of Sy_i(t) . 

Coefficients A^ A X A_ X and -A R 2 A { B_i are next determined through digital low 
5 pass filters, e.g., low pass Butterworth filters, in processor 32 where 



The input signals for the digital filters are 2 4 (?) and E 4 (?) . Input signals 2 4 (?) 
and E 4 (?) are generated in processor 32 using signals S^, S\, I_j (?) , and E_j (?) 
according to the formulae 



4_,se_j cos -2^), 



(37) 



5_, S E_ 1 sin(C_ 1 +Ci-2^). 



(38) 



10 



15 




(39) 



M0=I>(0M'M(0M0] 



(40) 



Equations (39) and (40) are written in terms of A_ x and B_ x using Equations (37) 



20 



and (38) as 



r^cos(-29-; 1 -^ 1 +2^ j f) 
+ V4 +2e 2 cos(-39-^-^2 + 2^) 
_+2s 3 cos(-4q>-^-^ 3 +2^) 



(41) 
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^sin(-2<p-^-^+%) 
+2s 2 sin(-3q>-; 1 -; 2 +2^) 
+2e 3 sin(-4<p-^-^3+2^)_ 



+ V4 



(42) 



Signals £ 4 (f) and E 4 (f ) are sent to low pass digital filters in processor 32, e.g., low 
pass Butterworth filters, where coefficients 2A R A\A_ X and -2A R A\B_ X are determined. 
For a Butterworth filter T n (x) of order n , the corresponding outputs of the low pass digital 
filters for inputs E 4 (t) and E 4 (t) are 



r n [E 4 (/)] = 2V¥-i 



40 



2©r 



+ 2e 2 0 



+ 2e 3 0 



(43) 



r„[£ 4 (0] = -2V42L: 



(44) 



The terms on the right hand sides of Equations (43) and (44) with factors A R A 2 are 
the sources of the largest errors and accordingly determine the specifications of n and the 
minimum ratio for (o D /(o c that can be used when the outputs of processor 32 are stored in 
table 40. For a fourth order Butterworth filter ,i.e., n = 4, and a minimum ratio for 
co D /(o c = 3.5 , the error terms on the right hand side of Equations (43) and (44) will generate 

errors that correspond to £ 0.010 ran (3 a). The outputs 2A R 2 A\A_ X and -2A R A\B_\ of 
low pass filters of processor 32 are divided by 2 to generate A R 2 A\A_\ and -A R A\B_\ as 
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the outputs of processor 32. When the stage is moving at a speed such that the corresponding 
Doppler shift frequency & d /2ti is 10 to 100 times greater than the bandwidth of the stage 
servo control system and the requirement with respect to a>£>/co c is satisfied, the outputs 

ArA\A-\ md ~ a r' a \ b -\ of processor 32 are stored in table 40 and in processor 34 under 
the control of signal 72. 

Signals S\ (t) and 5j (/) are transmitted to processor 30 for the generation of 
conjugated quadratures 2 2 (0 md ^2(0 where 

S 2 (/)^5 r S 1 -5 r S 1 

" A l cos{y R +2q> + 2(; 1 -(; R ) 
+2e_ lC os(<p R 

= Af, A\ 

+2s 2 cos(<p R + 3cp + ^ + <; 2 - ) 
_+2e 3 cos^ + 4q> + + C, 3 - t; R ) 
+ A R 0(e i e j ), 

L 2 (t) = 5] • Ej + 5j ■ Ej 

"^ 1 sin((p^+2(p + 2<; 1 -(;^) 
+2s_! sin (cp* -<;*) 
+2s 2 sm^fl + 3(p + ^ + £ 2 - C/? ) 
_+2s 3 sin((p^ + 4(p + Ci + C3 " C/? ). 
+ ^0(e z £ ; ). 

Signals £ 2 (/) and 2 2 (0 correspond to the cyclic errror basis functions for iS\y 2 (0 in 
that the leading terms of Z 2 (0 and S 2 (0 are sinusoids with the same time- varying argument 
as that of 5^2(0 ■ 

Signals S u S h 7^(t), ^(t), S 2 (0,and ^ 2 (0 and coefficients A R A\ A_ h and are 
-A R A\B_ X transmitted to processor 34 and coefficients A R A X A_ X , and -A R 2 A\B_\ stored in 



(45) 



(46) 



31 



Attorney Docket No. 09712-330001 
Client Reference No. Z-429 

processor 34 under the control of signal 72 for generation of conjugated quadratures I 3 (0 

and E 3 0) where 



I 3 (i).Z, -S 2 -Z r Z 2 -3[V^4-,S, - V^-A] 
3e_j cosftp/? + cp + 2d + - 2^ ) 
= Ar 2 a 2 +4cos((p s +3(p + 2; 1 +Q-2; /? ) 
* 1 +3e 2 cos((p /? +4(p + 2^1+^2 -2C, R ) 
+3s 3 cos(cp^ + 5<p + + £ 3 - 2^ ) _ 

- 3[ V^l^i^l " V^-i^i ] + A R 2 A X 0(e^ ) + . . 



(47) 



S 3 (0 = W 



4cos(cp^ +3cp + 3Ci -2C/?) 
+3e 2 costq)^ + 4cp + 2^i + ^ 2 ~ 2 C /? ) 
+3e 3 cos(<p* + 5<p + 2d +C3 ~ 2 Cj?) 



(48) 



S 3 (/)-£! .£ 2 +2\ -Z 2 -3[W-l5i - V^-l-Sl] 

" 3e_! sin (9* + 9 + 2^ + d - 2C, R )' 

= A 2 A 2 +v4 l sin(( P^ + 3( P + 2 ^ + ^ -2 ^ ) 
R 1 +3s 2 sin(q> /? +49 + 2^+^2-2^) 
_+3e 3 sin(9^+ 59 + 2^+^-2^) _ 



(49) 



-3[A R 2 A l A. l S l + A R 2 A l B_ l S i yA R 2 A l 0(e i E j )+.. 



t 3 (t) = A R 2 Ai 



A\sm((p R +39 + 3^ -2^ R ) 
+3e 2 sin(9^ + 49 + 2^ + C 2 - 2C, R ) 
+3e 3 sin(9yj + 59 + 2^ + £ 3 - 2^? ) . 



+A R 2 A 1 0(e i e j )+.. 



(50) 
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Signals £3(0 and £3(0 correspond to the cyclic errror basis functions for £^3(0 in 
that the leading terms of Z 3 (f) and Z 3 (0 are sinusoids with the same time-varying argument 
as that of S^iO- 

Coefficient! 

coefficients for and are given explicitly by Equations (67) and (68), respectively, further 
below) are next determined through digital low pass filters, e.g., low pass Butterworth filters, 
in processor 38. The input signals for the digital filters are Z 5 (/) and I5 (t) , respectively. 

The input signals are generated in processor 38 using signals Si, S\, 2 2 (') > an< ^ ^2 (0 
according to the formulae 



M0-[*i(0M0+$MM0] . 



(51) 



s 5 (0-[Mf)M0-$W £ 2W] 



(52) 



The expansions of Z 5 (r) and t 5 (/) , given by Equations (51) and (52), respectively, 
in terms of cyclic error and non-cyclic error terms are 



H 5 {t) = A R A l 2 A 2 



s-l [2 cos (-9 + C_i - C/? ) + cos(3cp + 2^ - 5_! - )] 
+^cos((p + ^-^) 
+2s 2 cos(2( P + <; 2 -C/ ? ) 

+e 3 [2 cos (3cp + C 3 - C, R ) + cos (-q> + 2^ - C 3 - )] 



(53) 



33 



Attorney Docket No. 09712-330001 
Client Reference No. Z-429 



t 5 (t) = -A R A x 2 B 2 

e_! [2 sin (-9 + - ^ ) + sin(3(p + 2^ - - ^ )] 

+4 sin (cp +<;!-<;/?) 

+2e 2 sin (2cp + <;2 -<;/?) 

+63 [2 sin (3(p + C 3 - ) + sin (-cp + 2^ - ^ - C, R )] 
+ A R A l o(e 2 ) + ... , 



VA R A{ 



(54) 



where A 2 and i? 2 ^ given by Equations (67) and (68), respectively, shown further below. 

Signals E 5 (t) and E 5 (/) are sent to low pass digital filters in processor 38, e.g., low 
pass Butterworth filters, where coefficients A R A 2 A 2 and -A R A 2 B 2 are determined. For a 
Butterworth filter T n (x) of order n , the corresponding outputs of the low pass digital filters 
for inputs I 5 (/) and t 5 (t) are 



+ A R A l 2 



+2s 2 0i 



+26_,0 



+ 2s 3 0 



+ e_i 0 



{3w 
f 83 0i 



0) D 



(55) 



T n [Z 5 (t)] = -A R A 2 B 2 
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The terms on the right hand sides of Equations (55) and (56) with factors A R A\ are 
the sources of the largest errors and accordingly determine the specifications of n and the 
minimum ratio for co D /co c that can be used when the outputs of processor 38 are stored in 
table 40. For a fourth order Butterworth filter , i.e., n = 4 , and a minimum ratio 
fora)£,/co c = 7 , the error terms on the right hand side of Equations (55) and (56) will 

generate errors that correspond to ^ 0.010 nm (3 a). The outputs A R A\ Ai and -A R A\B2 
of low pass filters of processor 38 are the outputs of processor 38. When the stage is moving 
at a speed such that the corresponding Doppler shift frequency a> D /2n is 10 to 100 times 
greater than the bandwidth of the stage servo control system and the requirement with respect 
to cfl£>/(o c is satisfied, the outputs A R A\ A-± and -A R A\B2 of processor 38 are stored in 
table 40 under the control of signal 72. 

Coefficients A R A\ 'A$ and -Aj^A^B^ (where A-^ and 5 3 are cyclic error 
coefficients for and ar e given explicitly by Equations (69) and (70), respectively, further 
below) are next determined through a digital low pass filter, e.g., a low pass Butterworth 
filter, in processor 36. The input signals for the digital filters are Z 6 (() and S 6 (t) , 

respectively. The input signals are generated in processor 36 using signals Si, -Sj, ^(t), 
and 2 3 (t) according to the formulae 



The expansions of Z 6 (t) and t 6 (t) given by Equations (57) and (58), respectively, 
in terms of cyclic error and non-cyclic error terms are 




(57) 




(58) 
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e_ 1 cos(4c P -<;_ 1 +3(; 1 -2(;^) 
+4cos(2q>-?i+3Ci -2Cjj) 

+e 2 cos (<p - £2 + 3£i - 2£a ) + 3s 2 cos (3cp + Ci + " 2 C 7? ) 
+3£ 3 cos(4(p+^+^3-2^) 



(59) 



z 6 (0=-Va 3 5 3 

"s_ 1 sin(4( P -;_ 1 +3Ci-2^) 

+VV + ^ sin ( 2< p-^ +3 ^- 2 ^) 

* 1 +s 2 sin(( P -C2+3^-2^) + 3E2sin(3(p + ; 1 +(; 2 -2^) 
_+3e 3 sin (49+^+^3-^) 



(60) 



where A$ and 2? 3 are given by Equations (69) and (70), respectively. 

Signals S 6 (/) and E 6 (7) are sent to low pass digital filters in processor 36, e.g., low 
pass Butterworth filters, where coefficients A R 2 A^A^ and -Aj^A^B^ are determined. For 
a Butterworth filter T n (x) of order n , the corresponding outputs of the low pass digital 
filters for inputs 2 6 (t) and Z 6 (/) are 



T n [z 6 {tj\ = A R 2 A?A : 
AO 



+e 2 0 +3s 2 0i 



f3s,0 



(61) 
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r„[z 6 (/)] = - 

40 







2co^ 

















(62) 



The terms on the right hand sides of Equations (61) and (62) with factors A R A\ are 
the sources of the largest errors and accordingly determined the specifications of n and the 
minimum ratio for a> D /<i) c that can be used when the outputs of processor 36 are stored in 
table 40. For a fourth order Butterworth filter ,Le., n = 4, and a minimum ratio 
forces /co c = 3.5 , the error terms on the right hand side of Equations (61) and (62) will 

generate errors that correspond to < 0.010 nm (3 a ). The outputs A R 2 A^A 3 and 

-A^A^B^ of low pass filters of processor 36 are the outputs of processor 36. When the 
stage is moving at a speed such that the corresponding Doppler shift frequency (o D /2n is 10 
to 100 times greater than the bandwidth of the stage servo control system and the 
requirement with respect to ®d/(£> c is satisfied, the outputs A^A^A^ and -A^A^B-^ of 
processor 36 are stored in table 40 under the control of signal 72. 

Finally, quadratures S and S are transmitted from processors 52 and 56 respectively 
to processor 18 for the purpose of determining a value for A\ . First, a signal 
S(t)S(t) + S(t)S(t) is generated where 
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S(t)S(t) + S(t)S(t) = V +[s_i 2 +£o 2 +£ 2 2 +£3 2 ] 



+24E-1 005(29+1!;! 
+2^6 0 cos(<p + ^-; 0 ) 

+2Afii cos(-2(p + ^j -^3) 
+0(8,8,) . 



(63) 



The signal of Equation (63) is sent to a low pass digital filter in processor 18, e.g., a 
low pass Butterworth filter, where the coefficient A? is determined. For a Butterworth filter 
T n (x) of order n , the corresponding outputs of the low pass digital filter for the signal of 
Equation (63) is 



The accuracy required for the determination of A± is approximately 0.5% in order to 
limit errors generated in the computation of cyclic error signals S^j to < 0.010 ran (3 a ). 

Therefore the error terms s_ } 2 , e 0 2 , z-i , and e 2 2 on the right hand side of Equation (64) are 



negligible. The terms on the right hand side of Equation (64) of the form O — — are the 



sources of the largest Doppler shift frequency dependent errors and accordingly determine 
the specifications of n and the minimum ratio for (£> D /(d c that can be used when the output 
of processor 18 is stored in table 40. For a second order Butterworth filter , i.e., n = 2 , and a 
minimum ratio for ffl£>/a> c = 3.5 , the Doppler shift frequency dependent error terms on the 
right hand side of Equation (64) will generate errors that correspond to < 0.010 nm (3 a ). 



r n [s(0-s«+*M-s«]=4 2 +[ { 





(64) 
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The output A\ of the low pass filter of processor 18 is the output of processor 18. When the 
stage is moving at a speed such that the corresponding Doppler shift frequency a D /2n is 10 
to 100 times greater than the bandwidth of the stage servo control system and the 
requirement with respect to (o D /(o £ . is satisfied, the output A\ of processor 18 is stored in 
table 40 under the control of signal 72. 

Referring now to FIG. lb, processor 60 generates the compensating error signal . 

With respect to generating signal Sy , it is beneficial to rewrite the e_\ , s 0 , s 2 > an ^ 8 3 
cyclic error terms of Sy in terms of the highest order time dependent terms of Z_j {t) , 

M')> S R> Sr> M0' M')> M'W cos(<p* -(p-*;, +2^), 

sin((p/? -(p-^l + Kr), cos(y R + C, R ), sm(q> R + C, R ), cos (cp^ +2q> + 2^ -C, R ), 
s'm{<p R +2(p + 2^j -C,r), cos((p^ +3(p + 3^ -2C, R ), and sin^/j +3(p + 3^ -2t )R ) as 



s (A= e - lCOS ^ 1+ ^-i~ 2 ^) cos ( (p ^"" (f)_ ^ +2 ^) 

V |_- e -l sin ftl + " 2C/t ) sin (q>^ - q> - ^ ! + 2^ ) 
S0 cos fo-^) cos ( ( P/? + ^) 

|_-s 0 sin (Co - ^ )sin (<p^ + C, R ) 

E 2 cos (-2^ + <; 2 + )cos (q>/j + 2<p + 2^ - ^ ) 
|_-6 2 sin (-2^1 +C 2 +^) sin ((p^ +2<p + 2^ 
+ 1" s 3 cos (-3^ + ; 3 + 2^ ) cos (<p R + 3cp + 3^ - 2^ )" 
[-s 3 sin (-3^ + ; 3 + 2^ )sin (9^ + 3cp + 3^ - 2^ R ) 

Equation (65) for is next written in the form 



5 V)/ (r) = [^_ 1 cos((p /? -9-^1 +2(; R )-B_ ] sin(<p R -(p-Q +2^)] 
+ [ ^ocos((p^ +C/?)-5 0 sin((p^ +C,r)] 
+ [A 2 cos (9^ + 2<p + 2^i - Cj? ) " #2 sin (<P* + 29 + - C/j )] 
+ [A 3 cos (9^ + 39 + 3Ci - 2^ ) - 5 3 sin (9* + 39 + 3^ - 2^ )] 
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where A_\ , B_i, Aq, and B 0 are given by equations(37), (38), (22), and (23), 
respectively, and 

A 2 =£ 2 cos(-2<; 1 +C2+C/?) , (67) 



B 2 =s 2 sin(-2; 1 +C, 2 +t; R ), 



(68) 



^ 3 =s 3 cos(-3; 1 +<;3+2<; /? ), 

5 3 =s 3 sin(-3(; 1 +C3+2C/?) • 



(69) 
(70) 



Compensation error signal is generated in processor 44 using Equation (66), the 
coefficients transmitted from table 40 as signal 42, and the signals Z_j (t) , E_i (t) , S R , S R , 
15 Z 2 (0 > ^2 (0 ' ^3 (0' m & ^3 (0 (which comprise the cyclic error basis functions). 
Explicitly, 



M0 S 



A R A l A -l 



(71) 



V(V) 2 J" 2 'IV(V) 2 



(V) 2 <4 2 ) 3 J 3 U 2 ) 2 (V) 3 1 



20 



In other words, the compensation error signal is generated form a superposition of the error 
basis functions weighted by the cyclic error coefficients. 
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The compensating signal is subtracted from signal S in processor 46 under 
control of signal 76 to generate compensated signal S-Sy. Control signal 76 determines 
when signal S is to be compensated. The phase (p = AkL is then extracted from the 
compensated signal with a subsequent processor (not shown) to, for example, provide a more 
accurate measurement of the distance L. 

In the presently preferred embodiment, the error compensation signal Sy (t) is 

derived from prior values of the signals S(t), S(t), S R {t) and S R (t). For example, 
feedforward signals S'(t)> S'r(0 and S' R (t) (as described in Equations (14), (15), 
(16), and (17)) may replace signals S(t), S(t), S R (t) and S R (t) in the calculation of the 
cyclic error coefficients, the cyclic error basis functions, and the error compensation signal 
based on them. Furthemore, feedforward values for S'(t) and S' R (t) may replace S(t) and 

S R (t) in the calculation of quadrature signals S(t) and S R (t) according to Equations (1 1) 
and (12). As a result, the compensation of the signal S(t) in processor 46 can proceed at a 
high data rate because the error compensation signal (t) will have already been generated 
when processor 46 receives the real-time values of signal S(t) and the only real-time 
operation is the subtraction of the error compensation signal (/) from the main signal 
S(t). This feature is especially useful in the context of an interferometric stage system 
operating under servo control because the high data rate of the cyclic error compensation 
introduces negligible data age to the servo position information derived from the 
compensated signal. 

In further embodiments, the cyclic error coefficients may be stored and updated at a 
lower data rate than that used to generate the cyclic error basis functions from the 
feedforward values. In such cases, the stored values for the cyclic error coefficients may 
used for the calculation of the cyclic error basis functions as necessary. Of course, in yet 
further embodiments, the coefficients and/or the error basis functions can be calculated in 
real time, without the use of the feedforward signals. 



41 



Attorney Docket No. 09712-330001 
Client Reference No. Z-429 



An important assumption in the preferred embodiment of the cyclic error 



compensation described above is that the Doppler shift frequency d(p(t)/dt is small enough 
relative to the heterodyne frequency (o R , that the quadrature signal S(t) can be 



approximated (in analogy to Equation (10) by the expression: 



5(0 = (cot(D#T)S(/-2"0- 



cos2©/jx 
sincOflT 



S(r-T) 



(72), 



or the simpler expression given by Equation (12). In further embodiments, the quadrature 



where (a M is the instantaneous rate of change of the phase of the main interference signal 
S{t), which can be determined with sufficient accuracy by assuming that the cyclic error 
contributions to S{t) arenegligle. 

Also, in further embodiments, the cyclic error compensation technique may be used 
for cyclic errors terms different from those explicitly described in Equations (3)-(6). In 
particular, using algebraic combinations of the signals S(t) , S(t) , Sr (t) , and S R (t) , a 
processing unit can generate cyclic error basis functions, which are sine and cosine functions 
that have the same time-varying arguments as the cyclic error terms that need to be 
compensated, and then use the cyclic error basis functions to project out respective cyclic 
error coefficients from S(t) and S(t) by low-pas filtering (e.g., averaging). 

For example, to determine the coefficients for a half-cycle cyclic error of the form: 



signal S(t) may be more accurately calculated according to: 



S(t) = (cot co^t) S(t - 2t) - 



cos 2(0 m% 
sinco^x 



■S(t-x) 



(73), 




(74), 
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one can calculater cyclic error basis functions for the half-cycle cyclic error according as 

follows. First calculate signals Z 7 (/) and Z 7 (/) as: 



= (75) 



2^4 



Notably, the leading term of Z' 7 (/) is |cos((p/2 + £i/2-^/2)| , and the leading term of 
Z' 7 (/) is jsin (9/2 + £i/2-£/j/2)|. Zero phase crossings in Z 7 (/) and Z' 7 (r) are then 
measured to remove the absolute value operation and define Z 7 (t) and Z 7 (/) , which have 
leading terms cos (9/2 + ^ /2 - ^ /2) and sin ((p/2 + ^/2-^/2), respectively. 
Half-cycle error basis functions Z}/ 2 (') and (') are then generated as: 

Z l/2 (t) = Z 7 (t)S R (t)-l 7 (t)S R (t) (77) 
Z 1/2 (/) = Z 7 (0^( f ) + Z 7 (0^(r) (78) 

where Zyj (t) and Zj/2 (t) have as their leading terms sine and cosine functions, 
respectively, whose time-dependent argument is the same as that of ^(1/2) • To determine 
the coefficients for S^y 2 y one projects half-cycle error basis functions onto S(t) and S(t) 
to move the half-cycle cyclic error component to zero frequency, for example: 

S 8 (f) = 5(r)Z 1/2 (r)+5(r)Z 1/2 (r) (79) 



t s {t) = S(t)2 l/2 {t)-S(t)t l/2 {t) 

43 



(80). 



Attorney Docket No. 09712-330001 
Client Reference No. Z-429 



Low-pass filtering (e.g., with the Butterworth filter) of E 8 (/) and E 8 (/) then yield the half- 
cycle error coefficients in analogy to the extraction of the previously described cyclic error 
coefficients. In particular, the leading terms following the low-pass filtering are 

5 A R A ]/2 cos(<;i/2 -C,j2-^ R /2) and A R Ay 2 sin^-^i^-^^), respectively. 

Finally, it is noted that, if desired, the accuracy of the cyclic error correction can 
further be improved to higher order in the amplitude of the cyclic error coefficients by 
iterating the compensation of the main signal S(t) . In other words, for each subsequent 
iteration the compensated signal for the main signal is used to generate corresponding cyclic 
10 error basis functions and determine a higher order correction to each of the cyclic error 
coefficients. 

Referring now to FIG. 2, a block diagram is shown for an M th order digital filter used 
in the cyclic error compensation described above. The figure is in the "Direct Form I" 
representation standard to the digital signal processing community. The input discrete time 

15 series is x(n) and the output discrete time series is y{n) . The z~ l operation indicates a one- 
sample delay. A time-domain representation of the filter takes the form: 

y (n) = b 0 x(n) + b\x(n - 1) + 1>ix(n - 2) + ... + b M x(n - M) 
-a l y(n-\)-a 2 y(n-2)-...-a M y(n-M) 

The coefficients a t and b t are selected to produce the desired properties for the filter. For 
20 the case of the Butterworth filter, the a t and b t coefficients are selected to produce the 
frequency filtering given by Equations 28, 29, 43, 44, 55, and 56. Furthermore, other 
embodiments of the cyclic error compensaiton may implement different low-pass filtering 
schemes to yield the coefficients of low- frequency terms. The Butterworth filter, and other 
low-pass digital filters, are well known in the art. See, for example: Oppenhiem, A.V., 
25 Schafer, R. W., and , J.R. Buck, "Discrete-Time Signal Processing", Upper Saddle River, NJ: 
Prentice Hall, 1999; and Proakis, J.G., and , D.G. Manolakis, "Digital Signal Processing: 
Principles, Algorithms, and Applications", New York, NY: MacMillan, 1992. 
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Depending on the embodiment, the compensation technique described above can be 
implemented in hardware or software, or a combination of both. The technique can be 
implemented in computer programs using standard programming techniques following the 
method and figures described herein. Program code is applied to input data to perform the 
5 functions described herein and generate output information. The output information is 
applied to one or more output devices such as the servo control system. 

Each program may be implemented in a high level procedural or object oriented 
programming language to communicate with a computer system, or the programs can be 
implemented in assembly or machine language, if desired. In any case, the language can be a 

10 compiled or interpreted language. Moreover, the program can run on dedicated integrated 
circuits preprogrammed for that purpose. 

Each such computer program may be stored on a storage medium or device (e.g., 
ROM or magnetic diskette) readable by a general or special purpose programmable 
computer, for configuring and operating the computer when the storage media or device is 

15 read by the computer to perform the procedures described herein. The computer program 
can also reside in cache or main memory during program execution. The compensation 
technique can also be implemented as a computer-readable storage medium, configured with 
a computer program, where the storage medium so configured causes a computer to operate 
in a specific and predefined manner to perform the functions described herein. 

20 Now referring to FIG. 3, an interferometry system including a high stability plane 

mirror interferometer (HSPMI) 311 is shown for optical generating the main interference 
signal. The HSPMI 31 1 includes a polarization beam-splitter 330, a retroreflector 332, 
quarter wave phase retardation plates 334 and 336, and a plane mirror reference object 342. 
Input beam 422 is a two-component beam. The two components have different frequencies 

25 and are orthogonally plane polarized. The different frequencies can be produced in source 

415, for example, by laser Zeeman splitting, by acousto-optical modulation, or internal to the 
laser using birefringent elements or the like. HSPMI 311 splits input beam 422 into two 
components. One component, shown as first and second pass measurement beams 322 and 
324, reflects from measurement object 490 twice before exiting HSPMI 311. The other 

30 component, shown by first and second pass reference beams 328 and 327, reflect from 
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reference mirror 342 twice before exiting HSPMI 311. The exiting beam components 
overlap and form output beam 423. 

An electrical interference signal 352 is generated by the detection of output beam 423 
in detector 420. Detector 420 includes a polarizer to mix the reference and measurement 
5 beam components of output beam 423 with respect to polarization. Electrical interference 
signal 352 contains a heterodyne interference signal corresponding to main interference 
signal S(t) . 

In further embodiments, the interferometry system may be different than that shown 
in FIG 3. In particular, the cyclic error compensation technique is applicable to other types 

10 of interferometers as well. For example, the main interference signal S(t) may be produced 
by an interferometry system that may include any of single and/or multiple pass 
interferometers, passive interferometers, dynamic interferometers, and dispersion 
interferometers. Furthermore, the interferometry system may monitor one or more degrees of 
freedom, each of which may produce a corresponding main interference signal S(t) , which 

15 may be compensated for cyclic errors as disclosed herein. Furthermore, the degree(s) of 

freedom monitored by the interferometry system may include any of changes in distance to a 
measurement object, changes in relative distance between two measurement objects, changes 
in the angular orientation of a measurement object, and changes in the direction of the input 
beam. 

20 Examples of dynamic interferometers are described in U.S. Patent Application Serial 

No. 10/226,591 filed August 23, 2002 and entitled "DYNAMIC INTERFEROMETER 
CONTROLLING DIRECTION OF INPUT BEAM" by Henry A. Hill. Examples of passive 
zero shear interferometers are described in U.S. Patent Application Serial No. 10/207,314, 
entitled "PASSIVE ZERO SHEAR INTERFEROMETERS," filed July 29, 2002, by Henry 

25 A. Hill. Examples of angular displacement interferometers are described in: U.S. Patent 
Application Serial No. 10/226,591 entitled "DYNAMIC INTERFEROMETER 
CONTROLLING DIRECTION OF INPUT BEAM," filed August 23, 2002; U.S. Provisional 
Application 60/314,345 filed August 22, 2001 and entitled "PASSIVE ZERO SHEAR 
INTERFEROMETERS USING ANGLE SENSITIVE BEAM-SPLITTERS," both by Henry 

30 A. Hill, and U.S. Patent Application Serial No. 10/272,034 entitled "INTERFEROMETERS 
FOR MEASURING CHANGES IN OPTICAL BEAM DIRECTION" and filed October 15, 
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2002 by Henry A. Hill and Justin Kreuzer. Alternatively, or additionally, interferometry 
systems may include one or more differential angular displacement interferometers, examples 
of which are also described in U.S. Patent Application Serial No. 10/272,034. Examples of 
interferometry systems for measuring more than one degree of freedom and for reducing 

5 beam shear are described in U.S. Patent Application Serial No. 10/352,616 filed January 28, 

2003 and entitled "MULTIPLE-PASS INTERFEROMETRY" by Henry A. Hill and U.S. 
Patent Application Serial No. 10/351,708 filed January 27, 2003 and entitled "MULTI-AXIS 
INTERFEROMETER" by Henry A. Hill. Other forms of multiple pass interferometers are 
described in an article entitled "Differential interferometer arrangements for distance and 

10 angle measurements: Principles, advantages and applications" by C. Zanoni, VDI Berichte 
Nr. 749, 93-106 (1989). Examples of two-wavelength dispersion interferometers are 
described in U.S. Patent No. 6,219,144 Bl entitled "APPARATUS AND METHOD FOR 
MEASURING THE REFRACTIVE INDEX AND OPTICAL PATH LENGTH EFFECTS 
OF AIR USING MULTIPLE-PASS INTERFEROMETRY" by Henry A. Hill, Peter de Groot, 

15 and Frank C. Demarest and U.S. Patent No. 6,327,039 Bl by Peter de Groot, Henry A. Hill, 
and Frank C. Demarest. 

Because of the cyclic error compensation, the interferometry systems described 
herein provide highly accurate measurements. Such systems can be especially useful in 
lithography applications used in fabricating large scale integrated circuits such as computer 

20 chips and the like. Lithography is the key technology driver for the semiconductor 

manufacturing industry. Overlay improvement is one of the five most difficult challenges 
down to and below 100 nm line widths (design rules), see, for example, the Semiconductor 
Industry Roadmap, p.82 (1997). 

Overlay depends directly on the performance, i.e., accuracy and precision, of the 

25 distance measuring interferometers used to position the wafer and reticle (or mask) stages. 
Since a lithography tool may produce $50-100M/year of product, the economic value from 
improved performance distance measuring interferometers is substantial. Each 1% increase 
in yield of the lithography tool results in approximately $lM/year economic benefit to the 
integrated circuit manufacturer and substantial competitive advantage to the lithography tool 

30 vendor. 
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The function of a lithography tool is to direct spatially patterned radiation onto a 
photoresist-coated wafer. The process involves determining which location of the wafer is to 
receive the radiation (alignment) and applying the radiation to the photoresist at that location 
(exposure). 

5 To properly position the wafer, the wafer includes alignment marks on the wafer that 

can be measured by dedicated sensors. The measured positions of the alignment marks 
define the location of the wafer within the tool. This information, along with a specification 
of the desired patterning of the wafer surface, guides the alignment of the wafer relative to 
the spatially patterned radiation. Based on such information, a translatable stage supporting 
10 the photoresist-coated wafer moves the wafer such that the radiation will expose the correct 
location of the wafer. 

During exposure, a radiation source illuminates a patterned reticle, which scatters the 
radiation to produce the spatially patterned radiation. The reticle is also referred to as a 
mask, and these terms are used interchangeably below. In the case of reduction lithography, 

15 a reduction lens collects the scattered radiation and forms a reduced image of the reticle 

pattern. Alternatively, in the case of proximity printing, the scattered radiation propagates a 
small distance (typically on the order of microns) before contacting the wafer to produce a 
1 : 1 image of the reticle pattern. The radiation initiates photo-chemical processes in the resist 
that convert the radiation pattern into a latent image within the resist. 

20 Interferometry systems are important components of the positioning mechanisms that 

control the position of the wafer and reticle, and register the reticle image on the wafer. If 
such interferometry systems include the features described above, the accuracy of distances 
measured by the systems increases as cyclic error contributions to the distance measurement 
are minimized. 

25 In general, the lithography system, also referred to as an exposure system, typically 

includes an illumination system and a wafer positioning system. The illumination system 
includes a radiation source for providing radiation such as ultraviolet, visible, x-ray, electron, 
or ion radiation, and a reticle or mask for imparting the pattern to the radiation, thereby 
generating the spatially patterned radiation. In addition, for the case of reduction 

30 lithography, the illumination system can include a lens assembly for imaging the spatially 
patterned radiation onto the wafer. The imaged radiation exposes resist coated onto the 
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wafer. The illumination system also includes a mask stage for supporting the mask and a 
positioning system for adjusting the position of the mask stage relative to the radiation 
directed through the mask. The wafer positioning system includes a wafer stage for 
supporting the wafer and a positioning system for adjusting the position of the wafer stage 
relative to the imaged radiation. Fabrication of integrated circuits can include multiple 
exposing steps. For a general reference on lithography, see, for example, J. R. Sheats and B. 
W. Smith, in Microlithography: Science and Technology (Marcel Dekker, Inc., New York, 
1998), the contents of which is incorporated herein by reference. 

Interferometry systems described above can be used to precisely measure the 
positions of each of the wafer stage and mask stage relative to other components of the 
exposure system, such as the lens assembly, radiation source, or support structure. In such 
cases, the interferometry system can be attached to a stationary structure and the 
measurement object attached to a movable element such as one of the mask and wafer stages. 
Alternatively, the situation can be reversed, with the interferometry system attached to a 
movable object and the measurement object attached to a stationary object. 

More generally, such interferometry systems can be used to measure the position of 
any one component of the exposure system relative to any other component of the exposure 
system, in which the interferometry system is attached to, or supported by, one of the 
components and the measurement object is attached, or is supported by the other of the 
components. 

An example of a lithography scanner 1100 using an interferometry system 1126 is 
shown in Fig. 4. The interferometry system is used to precisely measure the position of a 
wafer (not shown) within an exposure system. Here, stage 1122 is used to position and 
support the wafer relative to an exposure station. Scanner 1100 includes a frame 1102, 
which carries other support structures and various components carried on those structures. 
An exposure base 1104 has mounted on top of it a lens housing 1106 atop of which is 
mounted a reticle or mask stage 1116, which is used to support a reticle or mask. A 
positioning system for positioning the mask relative to the exposure station is indicated 
schematically by element 1117. Positioning system 1117 can include, e.g., piezoelectric 
transducer elements and corresponding control electronics. Although, it is not included in 
this described embodiment, one or more of the interferometry systems described above can 
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also be used to precisely measure the position of the mask stage as well as other moveable 
elements whose position must be accurately monitored in processes for fabricating 
lithographic structures (see supra Sheats and Smith Microlithography: Science and 
Technology). 

5 Suspended below exposure base 1104 is a support base 1113 that carries wafer stage 

1122. Stage 1122 includes a plane mirror 1128 for reflecting a measurement beam 1154 
directed to the stage by interferometry system 1126. A positioning system for positioning 
stage 1122 relative to interferometry system 1126 is indicated schematically by element 
1119. Positioning system 1119 can include, e.g., piezoelectric transducer elements and 

10 corresponding control electronics. The measurement beam reflects back to the 

interferometry system, which is mounted on exposure base 1104. The interferometry system 
can be any of the embodiments described previously. 

During operation, a radiation beam 1110, e.g., an ultraviolet (UV) beam from a UV 
laser (not shown), passes through a beam shaping optics assembly 1112 and travels 

15 downward after reflecting from mirror 1114. Thereafter, the radiation beam passes through a 
mask (not shown) carried by mask stage 1116. The mask (not shown) is imaged onto a wafer 
(not shown) on wafer stage 1122 via a lens assembly 1108 carried in a lens housing 1106. 
Base 1104 and the various components supported by it are isolated from environmental 
vibrations by a damping system depicted by spring 1120. 

20 In other embodiments of the lithographic scanner, one or more of the interferometry 

systems described previously can be used to measure distance along multiple axes and angles 
associated for example with, but not limited to, the wafer and reticle (or mask) stages. Also, 
rather than a UV laser beam, other beams can be used to expose the wafer including, e.g., x- 
ray beams, electron beams, ion beams, and visible optical beams. 

25 In some embodiments, the lithographic scanner can include what is known in the art 

as a column reference. In such embodiments, the interferometry system 1126 directs the 
reference beam (not shown) along an external reference path that contacts a reference mirror 
(not shown) mounted on some structure that directs the radiation beam, e.g., lens housing 
1106. The reference mirror reflects the reference beam back to the interferometry system. 

30 The interference signal produce by interferometry system 1 126 when combining 

measurement beam 1154 reflected from stage 1122 and the reference beam reflected from a 
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reference mirror mounted on the lens housing 1106 indicates changes in the position of the 
stage relative to the radiation beam. Furthermore, in other embodiments the interferometry 
system 1126 can be positioned to measure changes in the position of reticle (or mask) stage 
1 1 16 or other movable components of the scanner system. Finally, the interferometry 
5 systems can be used in a similar fashion with lithography systems involving steppers, in 
addition to, or rather than, scanners. 

As is well known in the art, lithography is a critical part of manufacturing methods 
for making semiconducting devices. For example, U.S. Patent 5,483,343 outlines steps for 
such manufacturing methods. These steps are described below with reference to FIGS. 5(a) 

10 and 5(b). FIG. 5(a) is a flow chart of the sequence of manufacturing a semiconductor device 
such as a semiconductor chip (e.g., IC or LSI), a liquid crystal panel or a CCD. Step 1151 is 
a design process for designing the circuit of a semiconductor device. Step 1152 is a process 
for manufacturing a mask on the basis of the circuit pattern design. Step 1153 is a process 
for manufacturing a wafer by using a material such as silicon. 

15 Step 1154 is a wafer process which is called a pre-process wherein, by using the so 

prepared mask and wafer, circuits are formed on the wafer through lithography. To form 
circuits on the wafer that correspond with sufficient spatial resolution those patterns on the 
mask, interferometric positioning of the lithography tool relative the wafer is necessary. The 
interferometry methods and systems described herein can be especially useful to improve the 

20 effectiveness of the lithography used in the wafer process. 

Step 1155 is an assembling step, which is called a post-process wherein the wafer 
processed by step 1154 is formed into semiconductor chips. This step includes assembling 
(dicing and bonding) and packaging (chip sealing). Step 1156 is an inspection step wherein 
operability check, durability check and so on of the semiconductor devices produced by step 

25 1155 are carried out. With these processes, semiconductor devices are finished and they are 
shipped (step 1157). 

FIG. 5(b) is a flow chart showing details of the wafer process. Step 1161 is an 
oxidation process for oxidizing the surface of a wafer. Step 1162 is a CVD process for 
forming an insulating film on the wafer surface. Step 1163 is an electrode forming process 

30 for forming electrodes on the wafer by vapor deposition. Step 1 164 is an ion implanting 
process for implanting ions to the wafer. Step 1165 is a resist process for applying a resist 
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(photosensitive material) to the wafer. Step 1166 is an exposure process for printing, by 

exposure (i.e., lithography), the circuit pattern of the mask on the wafer through the exposure 

apparatus described above. Once again, as described above, the use of the interferometry 

systems and methods described herein improve the accuracy and resolution of such 

5 lithography steps. 

Step 1167 is a developing process for developing the exposed wafer. Step 1168 is an 

etching process for removing portions other than the developed resist image. Step 1 169 is a 

resist separation process for separating the resist material remaining on the wafer after being 

subjected to the etching process. By repeating these processes, circuit patterns are formed 

1 o and superimposed on the wafer. 

The interferometry systems described above can also be used in other applications in 
which the relative position of an object needs to be measured precisely. For example, in 
applications in which a write beam such as a laser, x-ray, ion, or electron beam, marks a 
pattern onto a substrate as either the substrate or beam moves, the interferometry systems can 

1 5 be used to measure the relative movement between the substrate and write beam. 

As an example, a schematic of a beam writing system 1200 is shown in FIG. 6. A 
source 1210 generates a write beam 1212, and a beam focusing assembly 1214 directs the 
radiation beam to a substrate 1216 supported by a movable stage 1218. To determine the 
relative position of the stage, an interferometry system 1220 directs a reference beam 1222 to 

20 a mirror 1224 mounted on beam focusing assembly 1214 and a measurement beam 1226 to a 
mirror 1228 mounted on stage 1218. Since the reference beam contacts a mirror mounted on 
the beam focusing assembly, the beam writing system is an example of a system that uses a 
column reference. Interferometry system 1220 can be any of the interferometry systems 
described previously. Changes in the position measured by the interferometry system 

25 correspond to changes in the relative position of write beam 1212 on substrate 1216. 

Interferometry system 1220 sends a measurement signal 1232 to controller 1230 that is 
indicative of the relative position of write beam 1212 on substrate 1216. Controller 1230 
sends an output signal 1234 to a base 1236 that supports and positions stage 1218. In 
addition, controller 1230 sends a signal 1238 to source 1210 to vary the intensity of, or block, 

30 write beam 1212 so that the write beam contacts the substrate with an intensity sufficient to 
cause photophysical or photochemical change only at selected positions of the substrate. 
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